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Abstract 

Recent work has found that the propagation of be- 
haviors and sentiments through networks extends in 
ranges up to 2 to 4 degrees of distance. The regu- 
larity with which the same observation is found in 
dissimilar phenomena has been associated with fric- 
tion in the propagation process and the instability of 
link structure that emerges in the dynamic of social 
networks. We study a contagious behavior, the prac- 
tice of retweeting, in a setting where neither of those 
restrictions is present and still found the same result. 

Keywords: Twitter, information difussion, con- 
tagion, social networks, social distance 

1 Introduction 

Recent studies have suggested that obesity^, 
happiness 1 11). smoking [7j and loneliness |3] among 
others [inj can be contagious in social networks with 
up to three degrees of distance. The pattern has been 
named the three degree of influence rule^ . There are 
two parts in this remarkable finding. On one hand, 
there is a claim about causation and; on the other 
hand, there is an empirical observation about the so- 
cial distance at which a phenomena is expected to be 
observed by a given individual. 

An open question still remains regarding the under- 
lying mechanism of transmission in the case of causal 
processes [Ml [12]. For example, in the case of 



obesity, one mechanism that could explain the obser- 
vation of group formation around individuals weight 
is the spread of eating habits. One could consider 
the case of when the friend of your friend starts eat- 
ing some unhealthy food whereby the probability that 
your friend will gain weight increases as a by product 
of their friendship and, later, so does your probability. 
An alternative explanation may rest on individuals 
perception; for instance, an individual could ignore 
that he is unhealthily gaining weight because people 
surrounding him are also becoming fatter. Finally, 
the observation may simply be a correlation result- 
ing from the human inclination to form bonds with 
people that look like themselves. Only well-designed 
experiments will be able to identify the ruling mech- 
anism in each case|16j. 

Nevertheless, the claim regarding social distance is 
a different story. Three degree of influence implies 
that if you pick a randomly selected individual in a 
network and observe her behavior, feelings or prefer- 
ences, you should expect to also observe, with a given 
probability, similar behaviors, feeling or preferences 
in her relevant network at up to between 2 and 4 de- 
grees of separation in most studied phenomenaflO) . 
The regularity with which the same observation is 
found in dissimilar phenomena is, by itself, a black 
box. An explanation consistent with the social con- 
tagion hypothesis would be that the technology to 
produce social contagion is resource intensive (e.g. 
time consuming) and consequently subject to dimin- 
ishing returns to scale. Based on the dynamic of links 



formation, Christakis and Fowler [TO] have suggested 
that given that closer ties tend to be more stable 
than farther ones the set of individuals at greater so- 
cial distances change at a faster rate reducing the 
capacity of the spreading mechanism to act on them. 

It would be interesting to observe a social conta- 
gion phenomenon in a context without those restric- 
tions (i.e one in which friction to spread the contagion 
is negligible and social structure is stable) in order 
to verify whether it spreads to farther distances, as 
would be expected when they are not binding. 

To advance in that direction, we propose to exam- 
ine the information diffusion of messages in the social 
media Twitter as an example of social contagion. In 
Twitter, users share opinions and information with 
other users who decided to follow that users messages 
(tweets). Those tweets cannot be larger than 140 
characters. Other users can resend tweets (retweets) 
to their own followers by pressing a button. As a con- 
sequence, a tweet can travel from user to user. The 
practice of retweeting a tweet happens in a funda- 
mentally stable and friction- free environment. First, 
the dynamic of retweets extends for short periods of 
time after the original message was created [17]; con- 
sequently, the topology of the network surrounding 
a given user will not change significantly during the 
time in which contagion can occurs, and, second, the 
costs to produce a retweet are negligible such that 
the technology of the retweets can be taken as fric- 
tionless. Therefore, given that friction and instability 
of the social network should not be binding restric- 
tions to the spread of contagion, we should expect 
that the spread of tweets along Twitters social graph 
should reach higher social distances than those found 
in social phenomena were those restrictions are sup- 
posedly active. 

Many messages written in Twitter are resent by 
other users without manipulation. However, a user 
who follows two or more of the users who sent the 
message (either the original or a retweet) receives 
it only once. In this way. Twitter Co. avoid fill- 
ing their users timelines who follow two or more of 
those retweeterers with the same repeated message 
again and again. For example, consider a subgraph 
formed by four nodes (userO to userS) where user2 
and userS are followers between them and followers 



of userl. And, suppose that userl follows userO. If 
userl has retweeted a message from userO and, later, 
userS do the same, user2 will only receive the tweet 
through the path userO— >-userl and not through the 
path userO— )-userl— >-user3. Consequently, the dis- 
tance travelled by a given tweet along the underly- 
ing social graph connecting twitter accounts can be 
tracked. 

We will call the longest path between the original 
sender and the farthest retweeter the degree of influ- 
ence on Twitter to keep in mind the connection with 
Christakis and Fowler, although we are aware that 
the measure of influence on Twitter has been a mat- 
ter of controversy and it is still an open question [1] . 
However, our focus is not the analysis of influence by 
itself but the measuring of the social distance reached 
by a frictionless social contagion process. 

2 Methodology and Data 

For access to the data. Twitter offers an Application 
Programming Interface (API) split in three formats: 
Streaming API for real-time tweets, the Search API 
for past tweets and the REST API for specific queries 
about tweets and users. The first ones does not have 
important restrictions in the amount of queries, how- 
ever, the third one has a limit of 350 calls per hour. 
For this reason, we complement the REST API calls 
with an external proxy service called Apigee, that 
allowed us to continue performing queries after the 
depletion of our API calls on the official Twitter ser- 
vices. However, as a proxy service, the calls are slower 
than with the original API. 

For the purpose of this study, we collected tweets, 
their retweets and the social graph connecting the 
accounts of tweeters (the authors of original mes- 
sages) and retweeters (the re-senders of those mes- 
sages) according to the follower/following relation- 
ship between the users. The methodology was the 
following. 

First, using Twitter Streaming API (Application 
Programming Interface) , we obtained 3 million tweets 
in a real-time period of 24 hours (2012/06/01), col- 
lecting information about its sender, the time at 
which was created, among other information. As 
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mentioned above, in a network, one potential rea- 
son for the declining diffusion of a behavior at far- 
ther distances is the the higher instability of social 
structure as distance increases from a given individ- 
ual. To control for that eventuality, it was required 
to recover the social graph of a retweeted tweet in a 
short period of time. Unfortunately, Twitter imposes 
rate-limit to access to the REST API services used 
for this purpose. As a consequence, when a user has 
many followers, even using Apigee proxy, the recov- 
ering of the social graph is slow. Therefore, there is 
a risk that the observed graph had changed between 
the moment when the original tweet was sent and 
the instance in which the underlying social graph of 
followers and friends was generated. For this reason 
we opted to reduce the number of tweets for further 
analysis and instead of studying the 3 million tweets, 
we randomly selected a subsample of 400,000 tweets. 

Second, for each tweet in the dataset, we verified 
its status through the Twitter REST API, recovering 
the numbers of retweets that it received (if any) and 
the retweeter's ids for each tweet. We have taken 
a conservative approach to the process of informa- 
tion diffusion, specifically, we have focused our at- 
tention on native mechanisms of retweeting, which 
means we will consider as a retweeted tweet any tweet 
that Twitter API identifies as such. We selected this 
operationalization to observe cases of pure contagion. 
Recent work j2J has suggested broader definitions for 
retweeting behavior and it is a matter of future re- 
search to test whether the results presented here hold 
in those cases. 

The result gave us a total of 13,946 retweeted 
tweets (3.5%). This proportion is consistent with 
those found in previous studies |3|. Approximately, 
76.4% of the tweets were retweeted only once; 12,6% 
twice and 11% three or more times. As expected, 
we found a clear association between tweeters num- 
ber of tweets and their number of followers (figure 
la) and between the number of followers and amount 
of retweets (figure lb); although, these data suggest 
that more active tweeters are not necessarily more 
retweeted than less active ones (figure 2c). 

Third, for each retweeted tweet, we rebuilt the so- 
cial graph of friends and followers linking the tweeter 
with each retweeter using Twitter REST API. There- 
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Figure 1: Tweets, Retweets and Followers 
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fore, we generated 13,946 social subgraphs, one per 
retweeted tweets. Then, for each social graph, we 
compute the eccentricity of the tweeters tweet (i.e 
the longest geodesic connecting each tweeterer with 
the set of the retweeters of her tweet). As shown in 
figure 2 to 5, there were variation in eccentricities and 
number of retweets. 



Figure 2: 1 degree 95 RTs 




Figure 3: 1 degree 3 RTs 




3 Results 

Remarkably, in a network of followers, the social dis- 
tances travelled by retweeted tweets are in the same 
range found for other phenomena in the literature of 
social contagion. Nevertheless, friction in the propa- 
gating mechanism and instability of ties should not be 
conditioning the spread of contagion in this case. We 
found that 87% of retweeted tweets (12,126 tweets) 
were retweeted at one degree of distance, 7% (965 
tweets) were retweeted up to two degrees of distance, 
1% (156 tweets) travelled up to the third degree of 
distance and 0.5% of retweeted tweets were resent to 
farther distances including one tweet that traveled 
up to the ninth degree of distance via retweets. In 



Figure 4: 2 degrees 49 RTs 




Figure 5: 2 degrees 3 RTs 




the remaining 4.6% of cases, there is a mixed pat- 
tern where some retweets were made by unconnected 
accounts (figure 6). Hence, even in cases with a sig- 
nificant number of retweets, the audience remain fun- 
damentally local from a social structural perspective. 

On the other hand, as shown in figure 7, there 
is a positive although weak relationship between the 
number of retweets and the social distance travelled 
by a tweet. We also analyzed smaller samples of 
tweets and performed basic statistical analyses to ver- 
ify our reading of the data and found the same results. 

4 Discussion 

Recent work has found that the propagation of be- 
haviors and sentiments through networks extends in 
ranges up to 2 to 4 degrees of distance. This find- 
ing was so remarkable that Christakis and Fowler |9] 
decided to give it a name: the three degree of in- 
fluence rule. From the perspective of a theory of 
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Social distance between tweeterer and farthest retweeterer 

Figure 6: Eccentricities of retweeted tweets 



contagion, propagation does not extend to greater 
distances because either the mechanism of diffusion 
becomes weaker or the dynamic of Hnk formation and 
destruction negatively affects the reachabihty of in- 
dividuals located at greater distances.. We have pro- 
posed to measure the propagation of a behavior along 
a social network in circumstances in which neither of 
these constraints should be binding and found, con- 
trary to our expectation, that the diffusion remains 
in the same range found in the literature. 

One alternative explanation, based on homophily, 
would suggest that the decline at further distances is 
simply because dissimilar individuals are located at 
larger social distances from each other. Such a possi- 
bility would be consistent with a practice of retweet- 
ing as an expression of homophily. However, recent 
workjl3l offers evidence in the opposite direction sug- 
gesting greater levels of anti-homophily in retweeting 
behaviors. Further research is required to explain 
why social contagion usually stops up to the third 
degree of distance. 



References 




^ 1 1 1 1 1 

4-8 
Ali tweets wy3+ RTs 



i 



T 1 1 1 1 1 T 

1.0 1.5 2.0 2.5 3.0 3.5 4.0 

social distance 

Figure 7: Relationship between retweets and social 
distance with 3-1- RTs 



[1] Sinan Aral (2012): Identifying Influential and 
Susceptible Members of Social Networks. Sci- 
ence. Vol. 337 no. 6092 pp. 337-341. 

[2] Norhidayah Azman, David Millard, Mark 
Weal (2011): Patterns of Implicit and Non- 
follower Retweet Propagation: Investigating 
the Role of Applications and Hashtags Pro- 
ceedings of the ACM WebSci'll. Available at 
I ht t p : / / j ournal . webscience . or g/ 5 1 7/ , 

[3] Bongwon Suh, Lichan Hong, Peter Pirolli, and 
Ed H. Chi (2010): Want to be Retweeted? Large 
Scale Analytics on Factors Impacting Retweet 
in Twitter Network. 2010 IEEE Second Interna- 
tional Conference on Social Computing (Social- 
Com). 

[4] John Cacioppo, James Fowler and Nicholas 
Christakis (2009): Alone in the crowd: The 
structure and spread of loneliness in a large so- 



5 



cial network. Journal of Personality and Social 
Psychology, Vol 97(6) 977-991 

[5] Rumi Ghosh, Tawan Surachawala and Kristina 
Lerman (2011): Entropy-based classification of 
Retweeting activity on Twitter. Available at 
jhttp://arxiv.org/abs/1106.0346, 

[6] Nicholas Christakis and James Fowler (2007): 
The Spread of Obesity in a Large Social Net- 
work over 32 Years, The New England Journal 
of Medicine, 357:370-379 

[7] Nicholas Christakis and James Fowler (2008): 
The Collective Dynamics of Smoking in a Large 
Social Network. The New England Journal of 
Medicine, 358:2249-2258 

[8] Nicholas Christakis and James Fowler (2008): 
Estimating Peer Effects on Health in Social Net- 
works : A Response to Cohen-Cole and Fletcher; 
Trogdon, Nonnemaker, Pais. J Health Econ. 
September; 27(5): 14001405. 

[9] Nicholas Christakis and James Fowler (2009): 
Connected: The Surprising Power of Our Social 
Networks and How They Shape Our Lives. Little 
Brown, New York. 

[10] Nicholas Christakis and James Fowler (2011) So- 
cial Contagion Theory: Examining Dynamic So- 
cial Networks and Human Behavior. 

[11] James Fowler and Nicholas Christakis (2008): 
Dynamic spread of happiness in a large social 
network: longitudinal analysis over 20 years in 
the Framingham Heart Study. British Medical 
Journal; 337: a2338. 

[12] Lyons, R. (2010). The spread of evidence-poor 
medicine via flawed social-network analysis. 
Available at: ,http: / /arxiv.org/ abs/1007.2876( 

[13] Sofus Macskassy, Matthew MIchelson (2011): 
Why Do People Retweet? Anti-Homophily Wins 
the Day. Proceedings of the Fifth International 
AAAI Conference on Weblogs and Social Media. 



[14] Hans Noel and Brendan Nyhan, "The "Un- 
friending" Problem: The Consequences of Ho- 
mophily in Friendship Retention for Causal Esti- 
mates of Social Influence" , Social Networks forth- 
coming (2011), Available at a rXiv:1009.3243| 

[15] Niels Rosenquist, Joanne Murabito, James H. 
Fowler and Nicholas Christakis (2010): The 
Spread of Alcohol Consumption Behavior in 
a Large Social Network. Annals of Internal 
Medicine. 152:1-36 

[16] DA Shoham, L .Tong, PJ Lamberson, AH 
Auchincloss, J Zhang, et al. (2012) An Actor- 
Based Model of Social Network Influence on 
Adolescent Body Size, Screen Time, and 
Playing Sports. PLoS ONE 7(6): e39795. 
doi:10.1371/journal.pone.0039795 

[17] Diederik van Liere (2010): How far does a tweet 
travel? Information brokers in the Twitterverse. 
MSMIO, Toronto. 



6 



